feat(platform): add phase8 real load execution evidence pipeline by ActiveInAI · Pull Request #22 · ActiveInAI/ArchIToken

ActiveInAI · 2026-05-01T05:29:21Z

Summary

Adds Phase 8.2 real load execution evidence pipeline so ArchIToken can progress from 100k certification gates to machine-verifiable real execution evidence.

Scope

Extends the Phase 8 load evidence contract with staged smoke/1k/10k/25k/50k/100k execution results.
Requires immutable artifact binding:
- git SHA
- K8s manifest hash
- Docker image digest
- k6 script hash
Requires Prometheus, Grafana, and OpenTelemetry evidence before certification.
Adds Prometheus snapshot collection tooling.
Adds evidence merge tooling for k6 summary + Prometheus snapshot + K8s state + git metadata.
Adds live K8s runtime validation tooling.
Adds staged load execution scripts:
- smoke
- 1k
- 10k
- 25k
- 50k
- 100k
Adds final certification-from-evidence script.
Adds Phase 8.2 docs, bottleneck playbook, and certification report template.
Updates Phase 8.1 documentation to state that PR feat(platform): add phase8 real 100k load certification gates #21 introduced gates only and does not certify real 100k concurrency.

Validation

rm -f 04-backend/openapitools.json
git diff --check
python3 -m unittest tools/test_phase8_load_evidence.py
python3 -m unittest tools/test_phase8_prometheus_snapshot.py
python3 -m unittest tools/test_phase8_merge_load_evidence.py
python3 -m unittest tools/test_phase8_runtime_cluster_validation.py
bash -n 04-backend/scripts/phase8-run-smoke.sh
bash -n 04-backend/scripts/phase8-run-1k.sh
bash -n 04-backend/scripts/phase8-run-10k.sh
bash -n 04-backend/scripts/phase8-run-25k.sh
bash -n 04-backend/scripts/phase8-run-50k.sh
bash -n 04-backend/scripts/phase8-run-100k.sh
bash -n 04-backend/scripts/phase8-certify-from-evidence.sh
cargo fmt --all -- --check
cargo clippy --all-targets --all-features -- -D warnings
cargo test --all-targets --all-features
cargo build --release --bin insomeos-gateway
04-backend/scripts/guard-proprietary-runtime.sh

Boundary

No real external 100k load run was executed in this PR.

This PR does not claim ArchIToken is certified for 100k concurrent users. It adds the execution evidence pipeline required to run, merge, validate, and certify real external 100k load evidence.

No proprietary RealBIM / BlackHole / OptRapid runtime asset is imported into open-source core.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f9426e7968

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-01T05:33:09Z

+        "stage": stage,
+        "start_time": str(metadata.get("start_time", "")).strip(),
+        "end_time": str(metadata.get("end_time", "")).strip(),
+        "vu": int(metadata.get("vu", metric(summary, "vus_max", "max", 0))),


Use observed VUs when computing achieved concurrency

The merge step currently sets each stage vu from stage metadata first, and those metadata files are generated by the run scripts using the target VU constant rather than an observed value. This allows a run that never actually reached 100k VUs to still produce achieved_concurrency=100000 and pass certification gates, because validation only checks the merged achieved_concurrency field. Use k6-observed metrics (for example vus_max) as the source of truth and treat metadata as informational.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-01T05:33:09Z

+        "dropped_connections": int(metrics.get("realtime.dropped_connections", 0)),
+        "gateway_restarts": int(metrics.get("gateway.restarts", 0)),


Fail when restart/drop metrics are missing from Prometheus data

These fields default to 0 when absent, but the bundled snapshot collector (tools/phase8_collect_prometheus_snapshot.py) does not query realtime.dropped_connections or gateway.restarts, so merged evidence will report zero by default and can incorrectly satisfy threshold checks during real instability. This creates false-positive certifications unless missing critical metrics are treated as an error (or explicitly queried and required).

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-01T05:33:09Z

+    pods = [item for item in snapshot.get("items", []) if item.get("kind") == "Pod"]
+    for pod in pods:
+        if not pod_ready(pod):
+            errors.append(f"Pod/{pod.get('metadata', {}).get('name', 'unknown')} is not ready")


Restrict readiness checks to relevant workload pods

The validator marks the snapshot invalid if any pod in the namespace is not Running and ready, including unrelated completed Job/CronJob pods. In a production-like namespace with normal completed maintenance/migration pods, this will block certification even when all required gateway/stateful components are healthy. The readiness loop should scope to the certification workloads (or ignore terminal Succeeded pods) to avoid false failures.

Useful? React with 👍 / 👎.

ActiveInAI added 4 commits May 1, 2026 13:27

feat(platform): strengthen phase8 real load evidence contract

3e45a01

test(platform): add phase8 runtime evidence collection gates

8b5f2cd

test(platform): add phase8 staged real load execution scripts

216241d

docs(platform): document phase8 real load execution evidence

f9426e7

chatgpt-codex-connector Bot reviewed May 1, 2026

View reviewed changes

ActiveInAI merged commit 153aec5 into main May 1, 2026
6 checks passed

ActiveInAI deleted the feat/platform-phase8-real-load-execution-evidence branch May 1, 2026 05:35

ActiveInAI mentioned this pull request May 1, 2026

fix(platform): make phase8 certification evidence fail closed #23

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(platform): add phase8 real load execution evidence pipeline#22

feat(platform): add phase8 real load execution evidence pipeline#22
ActiveInAI merged 4 commits into
mainfrom
feat/platform-phase8-real-load-execution-evidence

ActiveInAI commented May 1, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		"dropped_connections": int(metrics.get("realtime.dropped_connections", 0)),
		"gateway_restarts": int(metrics.get("gateway.restarts", 0)),

Conversation

ActiveInAI commented May 1, 2026

Summary

Scope

Validation

Boundary

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant